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Abstract 


The main goal in this paper is to use a dual equivalence in automata 
theory started in [25] and developed in [3] to prove a general version 
of the Eilenberg-type theorem presented in [4]. Our principal results 
confirm the existence of a bijective correspondence between three con- 
cepts; formations of monoids, formations of languages and formations 
of congruences. The result does not require finiteness on monoids, nor 
regularity on languages nor finite index conditions on congruences. We 
relate our work to other results in the field and we include applications 
to non-r-disjunctive languages, Reiterman’s equational description of 
pseudovarieties and varieties of monoids. 


Keywords: formations, semigroups, formal languages, automata 
theory. 


1 Introduction 


An important result in the algebraic study of formal languages and automata 
is Eilenberg’s variety theorem. It establishes a lattice isomorphism between 


1Departament d’Algebra, Universitat de Valéncia; Dr. Moliner, 50; E-46100 
Burjassot (Valéncia), Spain, E-mail: Adolfo.Ballester@uv.es, Enric.Cosme@uv.es, 
Ramon.Esteban@uv.es. 

Institut Universitari de Matematica Pura i Aplicada, Universitat Politécnica de 
Valéncia, Cami de Vera, s/n; E-46022, Valéncia, Spain, E-mail: resteban@mat.upv.es. 

3Centrum Wiskunde & Informatica; Science Park, 123; 1098 XG Amsterdam, The 
Netherlands, E-mail: Jan.Rutten@cwi.nl. 

“Radboud Universiteit Nijmegen; Heyendaalseweg, 135; 6500 GL Nijmegen, The Nether- 
lands. 


172 ~~ Ballester-Bolinches, Cosme-Llépez, Esteban-Romero, Rutten 


varieties of regular languages, which are sets of regular languages closed 
under Boolean operations, derivatives, and preimages under monoid homo- 
morphisms, and varieties of finite monoids, which are classes of finite monoids 
closed under finite products, submonoids, and homomorphic images. At the 
heart of this result lie the characterisation of varieties of regular languages 
by their syntactic monoids and the closure properties of the corresponding 
sets of finite monoids. 


Several extensions of Eilenberg’s theorem, obtained by replacing monoids 
by other algebraic structures or modifying closure properties on the definition 
of variety of languages, are known in the literature. In this context, we 
mention a local version of Eilenberg’s theorem proved by Gehrke, Grigorieff, 
and Pin [9] working with a fixed finite alphabet and considering only regular 
languages on it, and the extension of this result to the level of an abstract 
duality of categories by Addmek, Milius, Myers, and Urbat [1]. 


Another further step in this research programme is to replace varieties 
of finite monoids by the more general notion of formation, that is, a set of 
finite monoids closed under taking epimorphic and isomorphic images and 
finite subdirect products. 


Formations of finite groups are important for a better understanding 
of the structure of finite groups, and the more general notion of formation 
of algebraic structures, introduced and studied by Shemetkov and Skiba 
in [26], plays a central role in universal algebra. Therefore it seems quite 
natural to seek an Eilenberg type theorem establishing a connection between 
formations of finite monoids and formations of regular languages, which are 
classes of regular languages closed under Boolean operations and derivatives 
with a weaker property on the closure under inverse monoid morphism. This 
was established in [4]. The weaker closure conditions for formations lead to 
more possibilities than for varieties as more general classes of languages can 
be described and understood. 


Our principal aim here is to extend the main theorem of [4] to the level 
of general monoids. Our results are motivated by the significant role played 
by formations of non-necessarily finite groups in the structural study of the 
groups and some interesting families of non-regular languages that have 
recently appeared in the literature. 

The main contribution of this paper is an Eilenberg type theorem 
which bijectively relates formations of non-necessarily finite monoids with 
formations of non-necessarily regular languages. This result is the most 
general correspondence known to us. Our approach is based on a dual 
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equivalence in automata started in [25] and developed in [3]. This dual 
equivalence relates two special classes of automata: on the one hand, the 
set of quotients of the initial automaton A* with respect to a congruence 
relation C C A* x A*; and on the other hand, the classes of preformations 
of languages, which are subautomata of the final automaton 2“ that are 
complete atomic Boolean algebras closed under derivatives. This result is 
ultimately based on the description of two important functors on automata, 
free and cofree, defined upon equations and coequations, respectively. The 
coalgebraic approach used in this result adds expressiveness to our treatment 
and it highlights the fundamental role of duality in algebraic automata 
theory. Furthermore, this dual equivalence generalises some results in a 
recent line of work which uses a Stone-like duality as a tool for proving 
the correspondence between local varieties of regular languages and local 
pseudovarieties of monoids [9]. We have done our best to make the paper 
self-contained and so we present in sections 2 and 3 the results on automata 
theory proved in [25] and [3] which are fundamental in the proofs of our 
main theorems. 


Section 4.1 covers various topics of formation theory. Our approach 
depends heavily on the notion of a formation of congruences, which is 
an assignment that maps every alphabet A to a filter on the set of all 
congruences on A* closed under taking kernels of monoid epimorphisms. It 
is ultimately based on the results presented by Thérien in [28, 29] describing 
*_varieties of congruences. The work of Thérien on Eilenberg’s theorem 
highlights the role of congruences on the description of varieties of finite 
monoids or regular languages. It allowed him to effectively construct and 
hierarchize several important pseudovarieties. The same triple relation 
between monoids, congruences and languages is presented here for formations. 
Section 4.2 contains our main results; we prove that there is a bijective 
correspondence between formations of monoids and formations of congruences 
(Theorem 6), and formations of languages are in a one-to-one correspondence 
with formations of congruences (Theorem 7). 


We end the paper discussing how our work relates to other results 
in the field. A first result relates our work to the Eilenberg’s result on 
formation of finite monoids presented in [4]. As a consequence, formations of 
languages are shown to be closed under Boolean operations, derivatives and 
inverses of surjective homomorphisms. Hereafter, we show an example of an 
application to non relatively disjunctive languages ({12, 30, 21]). A language 
L is relatively disjunctive if there exists a dense language intersecting finitely 
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many times on each class of the syntactic congruence associated to L. It 
has been shown in [15] that this condition is equivalent to L having a non 
relatively regular syntactic monoid, that is, a monoid not containing a finite 
ideal. We present characterizations of non relatively regular languages in 
terms of their syntactic monoids and their syntactic congruences. 

We then relate, in the finitary case, the congruence approach done by 
Thérien to the construction of the relatively free profinite monoid associated 
to a pseudovariety of monoids [2]. We discuss how this approach could help 
us to retrieve a similar situation for formations of monoids satisfying some 
conditions. We finally present a discussion on the variant of Eilenberg’s 
theorem for varieties of monoids presented in [3]. 


2 Preliminaries 


Some Results on Automata 


An automaton is a pair (X,q@) consisting of a possibly infinite set X of states 
and a transition function a: X — X4, with inputs from an alphabet A. In 
pictures we use the following notation; for x,y € X andaé A, 


@Q—-@® © aa=y. 


We also write 2, = a(x)(a) and, more generally, for the empty word 
e € A* and a word w © A* we define its transition, respectively, by 


ie Lig = Old (a): 


An automaton can have an initial state x € X, here represented by 
a function x: 1 + X, where 1 = {0}. We call a triple (X,2z,a@) a pointed 
automaton. In pictures we use an entering arrow to indicate that a state is 
initial. An automaton can also be coloured by means of a colouring function 
c: X — 2 using as set of colours 2 = {0,1}. We call a state x accepting (or 
final) if c(a) = 1. We call a triple (X,c,a@) a coloured automaton. In pictures 
we use a double circle to indicate that a state is accepting. We call a 4-tuple 
(X,x,c,a) a pointed and coloured automaton or, simply, automaton. For 
instance, in the following automaton over A = {a,b}, 


@L Oe: 


b 
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the state x is accepting and the state y is initial. 

A function h: X — Y is a homomorphism between the automata (X, a) 
and (Y, 2) if for each word w € A, h(aw) = h(x)y. An epimorphism is a 
homomorphism that is surjective, a monomorphism is a homomorphism 
that is injective, and, finally, an isomorphism is homomorphism that is 
bijective. A homomorphism of pointed automata moreover respects initial 
states. Conversely, a homomorphism of coloured automata respects colours. 
If X C Y and A is subset inclusion, then we call X a subautomaton of Y 
(respectively a pointed and a coloured subautomaton). For an automaton 
(X,a) and x € X, the subautomaton generated by x, denoted by (x) C X, is 
the smallest subset of X that contains x and is closed under transitions. We 
calla relation RC X x Y a bisimulation of automata if for all (x,y) € X x Y 
and for all a € A, if (z,y) € R, then (xq, ya) € R. A bisimulation EF C X xX 
is called a bisimulation on X. If E is an equivalence relation, then we call it 
a bisimulation equivalence. The quotient map of a bisimulation equivalence 
on X is an epimorphism of automata gq: X > X/E. For a homomorphism 
h: X > Y, its kernel and image are defined by 


ker(i) = Jie) ex eX | h@) =n) 
im(h) = {yeY|aveX (h(x) =y)}. 


The ker(h) is a bisimulation equivalence on X and im(h) is a subau- 
tomaton of Y and, moreover, X/ker(h) is isomorphic to im(h). 

The set A* forms a pointed automaton (A*,¢,c) with initial state ¢ 
and transition function o defined by concatenation, that is o(w)(a) = wa. 
It is initial in the following sense: for each given automaton (X,qa) and 
every choice of initial state x: 1 > X, it induces a unique homomorphism 
rz: (A*,a) > (X, a), given by r,(w) = tw, that makes the following diagram 
commute: 


(X, a) 


The function rz maps a word w to the state x, reached from the initial state 
x on input w and is therefore called the reachability map for (X,x,q). 
The set 24° of languages forms a coloured automaton (24°27, T) with 
colouring function ¢? defined by ¢?(L) = 1 if and only if e € L, and transition 
function 7 defined by right derivation, that is T(L)(a) = La, where 


La = {uw € A* | aw € LT}. 
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Left derivation is defined analogously. It is final in the following sense: 
for each given automaton (X,qa) and every choice of colouring function 
c: X — 2, it induces a unique homomorphism o,: (X,a) > (24,7), given 
by o-(x) = {w € A* | c(aw) = 1}, that makes the following diagram 


commute: 
2 
an le 
) 


Cc 
(40) St (24 5 


The function o, maps a state x to the language o,(x) accepted by x. Since 
the language o,(x) can be viewed as the observable behaviour of x, the 
function 0, is called the observability map for (X,c,a). Summarising, we 
have set the following scene: 


1 oN | ” 


If the reachability map r, is surjective, then we call (X,2,a@) reachable. 
If the observability map 0, is injective, then we call (X,c,a@) observable. And 
if rz is surjective and 0, is injective, then we call (X,2,c,a) minimal. 


Free and Cofree Automata 


Definition 1 A set of equations is a bisimulation equivalence relation E C 
A* x A* on the automaton (A*,o). We define (X,x,a@) FE E —and say: the 
pointed automaton (X,2z,q@) satisfies E— by: 

(X,n0)F BE = Now) e BE, ay = fu 
We define (X,a) : E —and say: the automaton (X,q) satisfies E— by: 


(Xa EE & VWa:13X, (X,z,a) FL 


We shall sometimes consider also single equations (v,w) € A* x A* and use 
shorthands such as (X,a) F v = w to denote (X,a) FE v=w where v=w is 
defined as the smallest bisimulation equivalence on A* containing (v, w). 
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Definition 2 A set of coequations is a subautomaton D C 24° of the 
automaton (24°,r). We say that the coloured automaton (X,c,q) satisfies 
D, written (X,c,a) E D, when for all x € X, o-(x) € D. We say that 
the automaton (X,q) satisfies D, written (X,a) E D, if for allc: X > 2, 
(X,c,a) — D. 


Let (X,q@) be an arbitrary automaton. We show how to construct an 
automaton that corresponds to the largest set of equations satisfied by (X,q). 
And dually, we construct an automaton that corresponds to the smallest set 
of coequations satisfied by (X, a). 


Definition 3 Let X = {x;}ier be the set of states of an automaton (X,a). 
We define a pointed automaton free(X, a) in two steps as follows: 


(i) First, we take the cartesian product of the pointed automata (X, x;, a) 
that we obtain by letting the initial element x; range over X. This 
yields a pointed automaton (ILX,%,@) with 


ILX = II x 
vw: 13 X 


(where X, = X), EF = (xi )ier, and @ defined component-wise. 
(ii) Next we consider the reachability map rz: A* + ILX and define: 
Eq(X, a) = ker(rz), free(X,a) = A*/Eq(X,a). 
This yields the pointed automaton (free(X, a), [e],[o]). Note that free(X, a) 
is isomorphic to im(rz). 


Definition 4 Let X = {x;}ier be the set of states of an automaton (X,a). 
We define a coloured automaton cofree(X,a) in two steps as follows: 


(i) First, we take the coproduct of the coloured automata (X,c,a) that we 
obtain by letting c range over the set of all maps X —+ 2. This yields a 
coloured automaton (=X, é,@) with 


OX = S- So 


c: X32 


(where X. = X), and ¢ and & defined component-wise. 
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(ii) Next we consider the observability map og: SX — 24° and define: 
coEq(X, a) = im(oe), cofree(X, a) = coEq(X, a). 


This yields the coloured automaton (cofree(X,a),¢?,7). Note that cofree(X, a) 
is isomorphic to UX /ker(oe). 


The automata free(X, a) and cofree(X, a) are free and cofree on (X, a), 
respectively, because of the following universal properties: 


1 Va Ve 2 
a, ye 
free(X,a) - i (X, a) (X, a) - ao cofree(X, a) 


For every point 7: 1 > X, there exists a unique homomorphism from 
free(X, a) to (X, a) given by the x-th projection. Dually, for every colouring 
c: X — 2, there exists a unique homomorphism from (X,a@) to cofree(X, a), 
given by the c-th embedding. 


Proposition 1 The set Eq(X,qa) is the largest set of equations satisfied by 
(X,a). Dually, coEq(X,a) is the smallest set of coequations satisfied by 
(X, a). 


Example 1 Consider the automaton (Z,7) below: 


a 


an) = +L Oe: 


b 


The product over all its possible initial states is given by: 
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Hence im(r(z,)) ts the part reachable from (x,y). We know that 
free(Z, y) is isomorphic to im(r(z,y)), which leads to the following isomorphic 
automaton: 


free(Z,y) = 
c= () i O)=e 
b 
Since free(Z, y) = A*/Eq(Z, 7), we can deduce from the above automaton 
that Eq(Z, y) consists of Eq(Z,7) = {aa =a, bb=b, ab=b, ba=a}. The 


set Eq(Z, 7) is the largest set of equations satisfied by (Z,y). Neat we turn 
to coequations. The coproduct of all 4 coloured versions of (Z,7) is 


a a 
c@L @=:  +>-@lL_ @>: 
b b 


(2Z,é,4) = 
OL TOs: <@L We: 
b b 


The observability map 02: SZ —> 24" is given by 


oe(21) | o2(yr) | oc(x2) | o2(y2) | oe(wa) | oc(ys) | oe(aa) | oc(ya) 
@ | AY | A® | (arb) | (a*b)* | Gra)t | (6a) 


Since cofree(Z, y) = im(oz), this yields 


a 
@>+ <@O@>: 

b 

a 
@>- <@OB>- 

b 


cofree(Z,y) = 
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The set of states of this automaton is coEq(Z,y), which is the smallest 
set of coequations satisfied by (Z, 7). 


Summarising the present section, we have obtained, for every automaton 
(X, a), the following refinement of our previous scene (1): 


Va Ve 
1 2 
FN 
(A*, a) —— free(X, a) ---> (X, a) es cofree(X,a@) — (24°, 7) 


! 


3 A Dual Equivalence 


The purpose of this section is to see how the constructions of free and cofree 
can be regarded as functors between suitable categories. When we restrict 
them to certain subcategories, they form a dual equivalence. To this end, 
we denote: 


A: the category of automata (X,a) and automata homomorphisms, 
Am: the category of automata (X,qa) and automata monomorphisms, 
A-: the category of automata (X,qa) and automata epimorphisms. 


If (X,a) and (Y,8) are two objects in A,, and m is a monomorphism 
between (X,q@) and (Y, 3), we have that Eq(Y, () is contained in Eq(X, a). 
This allows us to define a natural epimorphism free(m) from free(Y, 3) to 
free(X,a). Therefore free: A, + (A-)°? is a functor. 

On the other hand, if e is an epimorphism from (X,a) to (Y,3), we 
have that coEq(Y, 3) C coEq(X,q@) and therefore the inclusion cofree(e) is a 
monomorphism from cofree(Y, 3) to cofree(X, a). Consequently, cofree: An, > 
A-°? is a functor. 


Congruence Quotients 


Let M bea monoid, a right congruence is an equivalence relation £ C MxM 
such that, for all (p,q) € M x M, if (p,q) € E, then, for all m € M, 
(pm,qm) € E. Left congruences are defined analogously. We call E a 
congruence if it is both a right and a left congruence. We denote the set of all 
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congruences on a monoid M by Con(M). If we consider the natural order in 
Con(M) given by inclusion, then Con(/) contains a greatest element given by 
the total relation on M, defined as V yy = M x M, and a least element given 
by the diagonal relation on M, defined as Ay = {(p,q) € M x M | p= 4q}. 

Note that for an alphabet A, the set F is a right congruence on A* if 
and only if it is a bisimulation equivalence on (A*,o). Next we introduce 
the category C of congruence quotients, which is defined as follows: 


objects(C) = {(A*/C, [o]) | C C A* x A* is a congruence relation}; 
arrows(C) = {e: A*/C > A*/D | e is an epimorphism of automata}. 


Note that C is a subcategory of Ae. 


Theorem 2 ([3, Theorem 10]) free(A,,) = C°?. 


Preformations of Languages 
A preformation of languages is a set V C 24° such that: 
(i) V is a complete atomic Boolean subalgebra of 24”, 
(ii) for all L € 24", if L € V, then, for all a € A, both La € V and gL EV. 


We note that, being a subalgebra of 24", a preformation of languages 
V always contains both § and A*. Next we introduce the category PL of 
preformations of languages, as follows: 


objects(PL) = {(V,r) | V C 2” is a preformation of languages}; 


arrows(PL) = {m: V + W | m is an monomorphism of automata}. 


Note that PL is a subcategory of Aj. 


A Dual Equivalence 


The following dual equivalence holds. 


Theorem 3 ((3, Theorem 22]) The category C of congruence quotients 
is dually equivalent to the category PL of preformations of languages via the 
functors free and cofree. 


cofree: C = PL°? : free 
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By Theorem 3, cofree o free(Z, y) is a preformation of languages. From 
the dual equivalence between these objects, if we apply free to the last 
automaton, we will obtain free(Z,y) again. 

In the proof of Theorem 3 one can also see that for w € A*, the equiv- 
alence class |[w] for a given congruence C' can be explicitly computed as 
the behaviour in (A*/C, |[o]) of the initial state [<] under a given colouring. 
It implies that these classes, which are sets of words and, consequently, 
languages in 24", belong to cofree(A*/C, [c]). On the converse, preforma- 
tions of languages (V,7) are complete atomic Boolean subalgebras of al 
having as atoms the corresponding equivalence classes in free(V,7). That 
is, forgetting all the automata structure, on objects we recover the classical 
dual equivalence: 


powerset: Set = CABA®? : atoms 


where CABA stands for the category of complete atomic Boolean algebra with 
homomorphisms preserving arbitrary infima and suprema. As a consequence, 
the cardinality of a finite Boolean algebra, which is always complete and 
atomic, is a power of 2. 


Example 4 (Example 1 continued) Consider our previous example (Z, y): 


eo) = OL TO> 


for which we had computed 


free(Z,y) = 


By Theorem 2, free(Z, 7) is a congruence quotient over A*. 
By a computation similar to the one done in Example 1, we obtain: 
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a 


@—@= 


a,b b 
cofree o free(Z,y) = 
a,b 
@—@>" - 


By Theorem 3, cofree o free(Z,y) is a preformation of languages. From 
the dual equivalence between these objects, if we apply free to the last au- 
tomaton, we will obtain free(Z,y) again. We can represent such Boolean 
algebras by their Hasse diagrams (indicating language inclusion by edges). 
Finally, the following picture includes an example of an epimorphism e and 
its image to illustrate the action of free and cofree on arrows: 


b 
a 


Ome ‘ 


b 


A* 


os 1 a 
x=) eC) =e Se 
b 


) 
cofree 

e| [m 
free 

(el) A 


We end this section presenting a useful consequence of the dual equiva- 
lence. Here we denote by (L) the minimal automaton for a fixed language 
Lew, 
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Proposition 2 ([3, Corollary 23]) For every congruence C in A* and 
every language L in 24", L € coEq(A*/C, |o]) if and only if C C Eq((L),7). 


4 KEilenberg’s Formation Theorem 


4.1 Formations 


In this section we define the notions of formations that we will use in what 
follows. For the sake of simplicity, we write (A*/C) instead of (A*/C, [o]) 
and (L) instead of ((L),7). 


4.1.1 Formations of Monoids 


Definition 5 Following [11, p. 78], we say that a monoid M is a subdirect 
product of a product of a family of monoids (M;)ier if M is a submonoid of 
the direct product |],-; Mj and each induced projection 7; from M onto M; 
is surjective. A monoid P which is isomorphic to such a submonoid M is 
also called a subdirect product of the family of monoids (M;)ier. 


In this case, the projections separate the elements of M, that is, if 
m(x) = m;(y) for alli € J, then x = y. In fact, we have the following 
proposition. 


Proposition 3 ([11, Proposition 3.1]) A monoid M is a subdirect prod- 
uct of a family of monoids (f;: Mi)ier if and only if there is a family of 
surjective morphisms (M — M;)ier which separates the elements of M. 


Subdirect products allow us to introduce the notion of formation of 
monoids, which is a particular case of the most general notion of formation 
of algebraic structures, introduced and studied by Shemetkov and Skiba in 
[26]. 


Definition 6 A formation of monoids is a class of monoids F satisfying: 
(i) every quotient of a monoid of F also belongs to F; 


(ii) the subdirect product of a finite family of monoids of F also belongs to 
F. 


We present some examples of interesting formation of monoids. 


Formations of Monoids, Congruences, and Formal Languages 185 


Example 5 


1. If F is a formation of monoids, then F., defined as the class of all 
monoids in F that are finite is again a formation. 


2. We say that a monoid M has a zero if there exists an element 0 © M, 
such that for every element m € M, the equation m0 = 0m = 0 holds. 
Such an element is unique and thus, one speaks of the zero element. 
The class Z of all monoids with zero is a formation of monoids. 


3. A monoid M is called relatively regular (r-regular for short) (see [15]) 
if it contains a finite ideal. The class R. of all r-regular monoids is a 
formation of monoids. Usual integers with multiplication (Z,-,1) is 
r-regular as it is a monoid with zero. The set Z* of nonzero integers 
is a submonoid of Z without finite ideals. Therefore, R is not closed 
under substructures. 


4. A monoid M is called cyclic if it is generated by one element me M. 
That is M consists of all powers m™ of m (here we use the notation 
m° = 1). If all these powers are distinct, then M is isomorphic to 
the additive monoid of all natural numbers (N,+,0). For a finite 
cyclic monoid M = (m) there is a smallest number n with the property 
m” =m*, for some k > n; n is called the index of the element m (of 
M). In this connection, if r is the smallest nonzero number with the 
property m™ = m"*", then r is called the period of m (of M). The 
pair (n,r) is called the type of m (of M). For any type (n,r) with 
n,r €N andr > 1, the relation: 


One = AnU{(p,q) ENxN|p,g2n andp=q mod r} 


is a congruence on N. The resulting quotient N/@n,, is a finite cyclic 
monoid with type (n,r). Every finite cyclic monoid M with type (n,r) 
is isomorphic to the quotient N/6,,-. A monoid is called periodic if 
all its cyclic submonoids are finite. The set P of all periodic monoids 
is a formation of monoids. A monoid M is called aperiodic if there 
exists a natural number k € N satisfying m® = m**1 for all me M. 
Obviously, aperiodic monoids are periodic. The class A of all aperiodic 
monoids is also a formation of monoids. 


5. A locally finite monoid is a monoid in which every finitely generated 
submonoid is finite. Obviously, locally-finite monoids are periodic. The 
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converse is false: there are even torsion groups that are not locally finite 
(see Burnside problem). The class of all locally-finite monoids, denoted 
by LRin is a formation of monoids. In general, if F is a formation 
of finite monoids. A locally F monoid is a monoid in which every 
finitely generated submonoid belongs to F. The class Le of all locally 
F monoids is a formation of monoids. 


Definition 7 For a monoid M, its residual with respect to a formation of 
monoids F, written CM, is defined as 


Cy =(}{C € Con(M) | M/C € F}. 
The above set is not empty as the total relation V yy is always included. 


Remark 1 In general, the quotient M/c¥ does not necessarily belong to 
the formation F. In fact, for the formation Z,, of all finite monoids with zero, 
the set N of all natural numbers including zero with the usual multiplication is 
a monoid whose residual with respect to Z,, is the diagonal relation. However, 
N is not finite. 


4.1.2 Formation of Congruences 


Definition 8 A formation of congruences is a function F that assigns to a 
set A, a set of congruences on A* satisfying the following conditions. 


(i) for each set A, the set F(A) is a filter in Con(A*); 


(ii) for every two sets A and B, and for every congruence E € F(B) 
with quotient morphism n: B* — B*/E, if there exists a monoid 
homomorphism yp: A* + B* such that the composition nop: A* > 
B*/E is a surjective monoid homomorphism, then ker(n o y) is a 
congruence in F(A). It can be depicted as follows: 


A* ———-» A*/ker(n 0 ¢) 


a 


B* —————» B* /E 
jj i 
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4.1.3. Formations of Languages 


Definition 9 A formation of languages is a function F that assigns to every 
alphabet A a set of formal languages satisfying the following conditions. 


(i) for each alphabet A, if L is a language in F(A), then coEq(A*/Eq(L)) 
is included in F(A); 


(ii) for each alphabet A, if both coEq(A*/C1), coEq(A*/C2) are included 
in F(A), then so is coEq(A*/C1 N C2); 


(itt) for every two alphabets A and B, if L is a language in F(B) and 
n: B* —> free((L)) denotes the quotient morphism, then for each 
monoid morphism yp: A* + B* such that n° ~ is surjective, the set 
coEq(A*/ker(7 0 y)) belongs to F(A). 


The above definition was completely given in terms of equations and 
coequations and from the very beginning it clearly underscores the relation 
between languages and congruences. This will have an impact on the later 
appearing results as it simplifies most of the steps in the proofs. 


4.2 Ejilenberg’s Theorem for Formations of Monoids 


We are now in position to show three different Eilenberg relations for for- 
mations. We first show that formations of monoids are in one-to-one corre- 
spondence with formations of congruences. After this result, we show that 
formations of congruences are in one-to-one correspondence with formations 
of languages. Consequently, formations for monoids and languages have also 
this correspondence. 


4.2.1 Monoids vs Congruences 


Proposition 4 Every formation of monoids F determines, in a canonical 
way, a formation of congruences F. 


Proof: Consider the assignment: 
F: Ar {C € Con(A*) | A*/C € F}. 


Let C, and C2 be congruences in F(A), then A*/C, and A*/C 2 are monoids 
in F. Note that Cy 9 Co is included in C; for 1 = 1,2. If we consider 
the corresponding quotient homomorphisms 7;: A*/C,M C2 — A*/C; for 
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i = 1,2, we have that {7,72} is a family of surjective morphisms separating 
the elements of A*/C, M C3. It follows from Proposition 3 that A*/C1, C2 
is a subdirect product of two monoids in F. Therefore Cy MC is a congruence 
in F(A). Now, if C is a congruence in F(A) and D is a congruence on A* 
with C C D, we have that A*/D is a quotient of A*/C. It follows that 
A*/D is a monoid in F, and we conclude that D is a congruence in F(A). 
Therefore, F(A) is a filter in Con(A*). 

Let A and B be two sets, and let E be a congruence in F(B) with 
quotient morphism 7: B* + B*/E. Let y: A* > B* be a monoid homo- 
morphism such that the composition 7 o y: A* —> B*/E is a surjective 
monoid homomorphism. Hence, A*/ker(7 0 vy) is isomorphic to B*/E, which 
is a monoid in F. It follows that A*/ker(7 0 y) is in F and ker(no y) isa 
congruence in F(A). 


Proposition 5 Every formation of congruences F determines, in a canoni- 
cal way, a formation of monoids F. 


Proof: We take F to be the class of all monoids M for which there exists 
a set A and a congruence C' € F(A) satisfying M = A*/C. We claim that 
this set is a formation of monoids. 

Let f: M —> N be the surjective monoid homomorphism defined on 
a monoid M in F. Then there exists a set A and a congruence C' € F(A) 
satisfying M ~ A*/C. Let y: A* > M be a monoid homomorphism with 
kernel C. Then f oy: A* > N is a surjective monoid homomorphism. 
Moreover, CC ker(f oy), which implies that ker(f oy) is a congruence in 
F(A). Finally, A*/ker(f oy) is isomorphic to N, and so N belongs to F. 

Now, let M be a monoid that can be expressed as the subdirect product 
of a finite family (;);en of monoids in F. Therefore, for each index 7 € n 
there exists a set A; and a congruence C; € Con(A;) satisfying M; = A? /C;. 
Let us denote the corresponding quotient homomorphisms as 7: A} — 
A;/C;. Consider the set B = U,.,, Ai. By the universal property of the free 
monoid, we can construct a monoid epimorphism y;: B* — A? for all 7 € n. 
Thus, 7; 0 yj: B* — A*/C; is a surjective monoid homomorphism for all 
ién. As F is a formation of congruences, the congruence D; = ker(1; © y;) 
belongs to F(B) for all i € n. Note that M can be expressed as the subdirect 
product of the finite family of monoids {B*/D; |i € n}. Since B generates 
each monoid in the family, M is generated by B. It follows that M = B*/F 
for some congruence F' on B*. Since M is a subdirect product of the monoids 
B*/D,;, we have that (),;-,,D; C F. Note that (),-,, Di is a congruence in 


1En iEn 
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F(B) as it is a finite intersection of congruences in F(B). Finally, F is a 
congruence in F(B) and M belongs to F. 

For the next correspondence, we will need one of the most important 
consequences of the universal property of the free monoid. Next Lemma 
states that all free monoids are projective. 


Lemma 1 ([17, p. 10]) For a set A and monoids P and Q, if y: A* > Q 
is a monoid homomorphism and 7: P > Q is a surjective monoid homo- 
morphism, then there exists a monoid homomorphism yp: A* > P with 


nop=y. 


Theorem 6 The mappings FH F and F + F define mutually inverse cor- 
respondences between formations of congruences and formations of monoids. 


Proof: Consider a formation of monoids F. The first correspondence 
gives us the formation of congruences F that assigns to each set A the set 
{C € Con(A*) | A*/C € F} of all congruences whose quotient belongs to F. 
Let H be the class of all monoids M for which there exists a set A and a 
congruence C' € F(A) satisfying M = A*/C. It immediately follows that H 
is included in F. The other inclusion follows easily since every monoid can 
be written as a quotient of a free monoid. 

Now, let F be a formation of congruences. The first correspondence 
gives us F, which is equal to the class of all monoids M for which there exists 
a set A and a congruence C' € F(A) satisfying M = A*/C. Let H denote 
the formation of congruence quotients that assigns to each set A the set 
{C € Con(A*) | A*/C € F}. For a fixed set A, if C is a congruence in F(A), 
then A*/C is a monoid in F and C belongs to H(A). Let C be a congruence 
in H(A), then A*/C is a monoid in F. Therefore, there exists a set B and 
a congruence D € F(B) satisfying A*/C = B*/D. Let 7: B* — B*/D 
and 6: A* — A*/C be the corresponding quotient homomorphisms. Let 
p: A*/C — B*/D be the corresponding monoid isomorphism. It follows that 
7 = poo is a monoid epimorphism from A* onto B*/D. By Lemma 1, there 
exists a monoid homomorphism y: A* > B* with no y = y. Summarising, 
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A —<— arc 


y 
Pi p 
i l 
BY ——____» B*/D 
jj / 


As F is a formation of congruences, ker(7 0 y) belongs to F(A). Finally, 
C is in F(A) as ker(7 0 w) = ker(y) = ker(p06) =C. 


Congruences vs Languages 


Proposition 6 Every formation of congruences F determines, in a canoni- 
cal way, a formation of languages F. 


Proof: Consider the assignment: 
F: Aro | J{coEq(A*/C) |C € F(A)}. 


Let L be a language in F(A), then there exists a congruence C' in F(A) 
for which L is a language in coEq(A*/C). By Proposition 2, we have that 
C C Eq(L). Thus, Eq(Z) is a congruence in F(A). Hence, coEq(A*/Eq(L)) 
is included in F(A). Now, if coEq(A*/C,) and coEq(A*/C2) are included 
in F(A), then the congruences Ci, C2 are in F(A). By assumption, the 
congruence CMC also belongs to F(A). Thus, coEq(A*/C1M C2) is in F(A). 
Let L be a language of F(B) with quotient morphism 7: B* — free((L)). Let 
yp: A* + B* such that 7 0 y is surjective, then ker(7 0 y) is a congruence in 
F(A). Thus, coEq(A*/ker(7 0 y)) belongs to F(A). Hence, F is a formation 
of languages. 


Proposition 7 Every formation of languages F determines, in a canonical 
way, a formation of congruences F. 


Proof: Consider the assignment: 
F: Ar {C € Con(A*) | coEq(A*/C) C F(A)}. 


Let C be a congruence in F(A). If D is a congruence on A* with C C D, then, 
by Theorem 3, coEq(A*/D) is included in coEq(A*/C), which is included 
in F(A) by assumption. Now, let C, and C2 be two congruences in F(A), 
then coEq(A*/C)) and coEq(A*/C»2) belong to F(A). As F is a formation of 
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languages, then coEq(A*/C1M C2) is included in F(A). Hence, C1 1 C2 is a 
congruence in F(A). Hence, F maps each alphabet A to a filter in Con(A”*). 

Let A and B be two sets and let C be a congruence in F(B). Consider 
the corresponding quotient homomorphism 7: B* + B*/C. Let y: A* > B* 
be a monoid homomorphism such that the composition 7 0 y: A* > B*/C 
is a surjective monoid homomorphism. Since F is a formation of languages, 
coEq(A*/ker(7 0 y)) is included in F(A). It follows that ker(7 0 ~) is a 
congruence in F(A). 


Theorem 7 The mappings F — F and F 4» F define mutually inverse 
correspondences between formations of congruences and formations of lan- 
guages. 


Proof: It immediately follows from the assignments we have chosen. 


Languages vs Monoids 


Proposition 8 Every formation of languages F determines, in a canonical 
way, a formation of monoids F. 


Proof: Just consider the composition of the correspondences given by 
Propositions 7 and 5. Hence, we take F to be the class of all monoids M 
that are isomorphic to A*/C for some congruence C’ on A* satisfying that 
coEq(A*/C) C F(A). 


Proposition 9 Every formation of monoids F determines, in a canonical 
way, a formation of languages F. 


Proof: Just consider the composition of the correspondences given by 
Propositions 4 and 6. Hence, for a set A we take the set of languages 


F(A) = |J{coEq(A*/C) | A*/C € F}. 


Theorem 8 The assignments F'> F and F > F define mutually inverse 
correspondences between formations of monoids and formations of languages. 


Proof: It immediately follows from Theorems 6 and 7. 
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5 Discussion and Related Work 


In this section we collect related works and we discuss how our work sub- 
sumes some results on Eilenberg’s theorem and some aspects of the duality 
perspective used in semigroup theory. 


5.1 On Languages 


A strong attempt to present a general result aiming at generalising varieties 
to the more general notion of formation was made in [4], although it was still 
made for formations of finite monoids. Since this result clearly relates to the 
theorems presented here, we will specify the existing connections. Possibly, 
the biggest. difference with the present work is the definition of “formation of 
language” adopted by them, much more in the original spirit of Eilenberg’s 
concept. We reproduce their definition to see that, for regular languages, the 
following definition and definition 9 coincide. In order to avoid confusion, 
we will rename their concept. 


Definition 10 A formation of regular languages is a function R that assigns 
to every alphabet A a set of regular languages over A satisfying: 


(i)’ for each alphabet A, R(A) is closed under Boolean operations and 
derivatives; 


(ii)’ for every two alphabets A and B, if L is a language in R(B) and 
n: B* — free(L) denotes the quotient morphism, then for each monoid 
morphism »: A* — B* such that no » is surjective, the language 
yp !(L) belongs to R(A). 


Clearly, this definition relates with the original definition of Eilenberg 
[6], the main difference being that here the composition 7 o y needs to be 
surjective. We will prove that if F is a formation of languages (in the sense 
of Definition 9) assigning to each alphabet A a set of regular languages, then 
this is equivalent to being a formation of regular languages (in the sense of 
Definition 10). To do so, we will need the following Lemmas. 


Lemma 2 Let R be a formation of regular languages. For an alphabet A, 
if C is a congruence on A* with finite quotient A*/C, then 


coEq(A*/C) CR(A) if and only if [wilco € R(A) for all w € A*. 
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Proof: Let w be a word in A*, for the colouring c,,: A*/C — 2, given 
by cw([ujc) = 1 if and only if [u]c = [w]c, we have that [w]co = 0,,,([E]) is 
a language in coEq(A*/C) which is included in R(A). Conversely, let L be 
language in coEq(A*/C), then L = U{|wjc | w € L}. As A*/C is finite, L 
is a finite union of languages in R(A). Hence, L belongs to F(A). 


Lemma 3 ([4, Proposition 2.14]) Jf L is a regular language over A, then 
every class [w] in A*/Eq(L) can be expressed as follows 


[w] = (Vole | w € yLe} \ tyLe | w ¢ yLe}. 


Theorem 9 Let F be a formation of languages assigning a set of regular 
languages to each alphabet, then F is a formation of regular languages. 
Conversely, if R is a formation of regular languages, then R is a formation 
of languages assigning a set of regular languages to each alphabet. 


Proof: Assume F is a formation of languages (see Definition 9) assigning 
a set of regular languages to each alphabet. 


(i)’ Let Z be a language in F(A), then coEq(A*/Eq(Z)) is included in 
F(A). As coEq(A*/Eq(Z)) is a preformation of languages containing 
L, then the complement and every derivative of LD belong to it. Let 
L[, and Lz be two languages in F(A), then coEq(A*/Eq(Z,)) and 
coEq(A*/Eq(L2)) are included in F(A). It follows that 


D = coEq(A*/[Eq(L1) N Eq(L2)]) 


is also included in F(A). By Proposition 2, we have that L; and L2 
are languages in D, which is a preformation of languages, then [1M La 
and L, U Lz are languages in D. 


(ii)’ Now, consider two alphabets A and B, and let L be a language in 
F(B). Let 7 denote the quotient morphism 7: B* — free(L) and 
let y: A* > B* be a monoid morphism such that 70 y is surjective. 
Then coEq(A*/ker(7 0 y)) is included in F(A). Let L’ be a language 
in (p-1(L)), then there exists some word u € A* with L/ = [p7!(L)Ju. 
Let (v, w) be a pair in ker(7 0 w), then 


Lh = [pe (L)w = {a € A* | woz € op 1 (L)} = {x € A* | y(uvz) € L} 
= {x€ A*| plu)y(v)y(x) € L} = {x € A* | (x) € Lyy(ry}- 
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(i) 


(ii 


(iii) 


Recall that L,,) is a language in (L), and (p(v), p(w)) is a pair in 
Eq(L), therefore: 


Li, = {£ eA’ | p(x) € Leta t {z ear | v(x) € Lgenatuys = Die 


It follows that ker(n oy) C Eq(y~!(L)). Again by Proposition 2, we 
have y~1(L) is a language in coEq(A*/ker(7 0 y)). 


Consequently, F is a formation of regular languages. 
Now, let R be a formation of regular languages (see Definition 10). 


Let L be a language in R(A). It is well known that a language L over 
an alphabet A is regular if and only if the set {,Ly | v,w € A*} is 
finite. Let [u] be an element in A*/Eq(L), then it holds by Lemma 3 
that 


[u] = (He | ue yLw} \ foLu | u ¢ vbw}. 


That is, every atom in coEq(A*/Eq(L)) belongs to the Boolean algebra 
generated by the derivatives of L. It follows from Lemma 2 that 
coEq(A*/Eq(L)) is included in R(A). 


Now, assume that coEq(A*/C 1) and coEq(A*/C2) are both included 
in R(A). Let [w]c,ac, be an atom in coEq(A*/CiN C2). The equation 
[wloincs = [wlc, O [wlc, trivially holds. Note that [w]c, is a language 
in coEq(A*/C;) for i = 1,2, and hence, included in (A). We conclude 
that [w]o,ac, is a language in R(A). By Lemma 2, coEq(A*/C1 NM C2) 
is included in R(A). 


Finally, consider two alphabets A and B, and let L be a language in 
R(B). Let 7 denote the quotient morphism 7: B* — free(Z) and let 
y: A* + B* be a monoid morphism such that 7 0 y is surjective. We 
have that y~!(L) is a language in R(A), hence, by the first item of this 
proof, we have that coEq(A*/Eq(y~!(Z))) is included in R(A). Let us 
check ker(n 0 y) = Eq(y~!(L)). We will only check that Eq(y~1(Z)) is 
included in ker(7 0 y); for the other inclusion, see the proof done in the 
first part of this Theorem. Let (v, w) be a pair in Eq(y~!(L)), we claim 
that (y(v), p(w)) is a pair in Eq(L). As no ¢ is surjective, for every 
word u € B*, there exists some word u’ € A*, with (u, y(u’)) € Eq(L). 
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Let L,, be a language in (L). 


Lugo) = Le(uelw (1 oy surjective) 
= Ly) (monoid homomorphism) 
= {xz € B* | y(u'v)z € L} 
= 1% EB* lee Ligtulivet 


={ze B | € © Le wusetey} (n° y surjective) 
= {xz € B* | y(u'v2’) € L} (monoid homomorphism) 
= {x € B*|u'vz' eg \(L)} 

= {2 ¢ B*|u'w2' cy" (L)} ((v,w) € Eq(y*(L))) 
= Lug(w)) 


thus, ker(7 oy) = Eqly 1 (L)). 


We conclude that R is a formation of languages. 

Recall that for the first implication in Theorem 9 no direct appeal to 
the regularity on the languages was needed. Therefore, we have the following 
corollary. 


Corollary 1 Let F be a formation of languages, then 


1. for each alphabet A, F(A) is closed under Boolean operations and 
derivatives; 


2. for every two alphabets A and B, if L is a language in F(B) and 
n: B* — free(L) denotes the quotient morphism, then for each monoid 
morphism yp: A* —+ B* such that no » is surjective, the language 
yp 1(L) belongs to F(A). 


A direct consequence of Theorem 9 is the Eilenberg result for formations 
of finite monoids presented in [4]. Therefore, our work subsumes this result 
as we do not require the monoids to be finite nor the languages to be regular. 


5.1.1 An Application to Non Relatively Disjunctive Languages 


In this subsection we present a direct application of Theorems 7, 8, and 
6. The following results can be found in [30]. All the notation appearing 
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there has been translated to our notation. We denote the cardinality of 
a language L over A by |L|. We call a language L over A disjunctive if 
Eq(L) is the diagonal relation on A*. We call a language L over A dense 
if A*wA* OL #4 @ for every w € A*; otherwise, the language L is said to 
be thin. According to Reis and Shyr, a language L is dense if and only 
if L contains a disjunctive language [21]. We shall call a language over A 
relatively f-disjunctive [relatively disjunctive] (rf-disjunctive |r-disjunctive] 
for short) if there exists a dense language D over A such that for all w € A* 
fwleq¢zy VD] < Xo, [[fwleq¢z) VP] <1). 
It has been shown in [12] that L is rf-disjunctive if and only if L is r- 
disjunctive and therefore, the two previous concepts are equivalent. The 


next result can be found in [15]. It relates r-disjunctive languages and 
r-regular monoids (see example 5). 


Theorem 10 A language L over A is r-disjunctive if and only if A*/Eq(L) 
is not r-regular. 


We shall denote by D(A) the set of all non-r-disjunctive languages 
over A. As a consequence of our previous section, we have a result on 
disjunctive languages that does not follow immediately from the definition. 


Corollary 2 The function ND: A> ND(A) is a formation of languages. 


Proof: It follows from Theorem 8 and the contrapositive version of 
Theorem 10. 
Another interesting result that we can find in [12] is the following. 


Theorem 11 Let L be language over an alphabet A, then L is r-disjunctive 
if and only if either A* has no dense Eq(L)-classes or has infinitely many 
dense Eq(L)-classes. 


We shall denote by D,(A) the set of all congruences on A having n 
dense C-classes, for n € N with n > 1. We have the following corollary. 


Corollary 3 The function D,: A— D,(A) is a formation of congruences. 


Proof: It follows from Theorem 7 and the contrapositive version of 

Theorem 11. 
As we did in example 5, the class of all r-regular monoids shall be 

denoted by R. All in all, we have a the following correspondences. 


R s ND s&s D 
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Example 12 We define the non-finite automaton (Nx 2, a) over the alphabet 
A = {a,b} whose set state is given by N x 2 and whose transition function 
a is given by the following equations. Forn EN, 


(n, 0)a = (n+ 1,0), (9,0)5 = (1); 


_ (0, 1), af n = 0; 
Om Dn= 4 (n—1,1), if n>O. 


(n, 1a = (0, 1} 


It can be depicted as follows. 


Consider the colouring 6(1,1): N x 2 — 2 defined by 6(.,1)(n,t) = 1 af 
and only if (n,t) = (1,1). Let L denote the language accepted by (0,0), that 
is L= 08.4; (0; 0). It is straightforward to see that, in this case, we obtain 
the classical example of a non-regular language, 


L= {a"b" |ne N}. 


An easy calculation shows that the automaton (N x 2,a) is isomorphic 
to (L). It follows that L satisfies non-trivial equations like 


ab = a7’, a‘ba = ba, or ababab = bab. 


The construction of the free automaton associated to (L) leads to the 
following automaton. 
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Note that the class [ba] is a zero element of the monoid A*/Eq(L), 
consequently A*/Eq(L) is r-regular as it contains a finite ideal which is 
precisely {|ba]}. By Theorem 10 L is a non-r-disjunctive in ND(A). Note 
that [ba] is the unique dense Eq(L)-class. 

Some interesting statements can be further derived from Corollary 2. 
Consider for example the alphabet B = {x,y,z,t}. The function ¢: B— A* 
given by 

rH a’, yr ab, ze b, tra, 


induces a surjective monoid homomorphism yp: B* + A*. Since L = {a"b" | 
n € N} is non-r-disjunctive, we can use Corollary 1 to conclude that the 
language 


yp *(L) = {w2* |w € {a,t}*, k = 2|wl, + |wle} U 
{wyz* |w € {x,t}*, k = 2|wle + |wle}, 


where |w|, is defined as the number of occurrences of the letter 1 in w, is a 
non-r-disjunctive language and, again by Theorems 10 and 11, we conclude 
that its syntactic monoid contains a finite ideal and there exists a finite 
number n, with n > 1, of dense Eq(y~*(L))-classes. 


5.2 On Congruences 


Certainly, the congruence approach we adopt in this work has been explored 
in other references. The most significant effort known to us was made by 
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Denis Thérien in his PhD thesis [28] (a summary can be found in [29]). 
Thérien considers the problem of providing an algebraic classification of 
regular languages. This problem was previously considered, although using a 
different approach, by Straubing [27] whose method involves counting certain 
factorizations of words. Thérien’s approach, on the other hand, makes use 
of congruences of finite index. The first main theorem in [28] subsumes 
Eilenberg’s original theorem by expressing the conditions defining varieties 
in terms of congruences. 


Definition 11 ([28]) A *-variety of congruences is a function A that as- 
signs to an alphabet A, a set of finite index congruences on A* satisfying the 
following conditions. 


(i) if 01,02 € A(A), then 01M 62 € A(A); 


(ii) for every two sets A and B and for every congruence 8 € A(B), for 
every monoid homomorphism 0: A* + B* and 6: B*/8 + T fora 
monoid T, then 080’ € A(A). 


Thérien extends Eilenberg’s theorem by proving that *-varieties of congru- 
ences are in one-to-one correspondence with varieties of regular languages 
and pseudovarieties of monoids. The main difference between *-variety of 
congruences and formations of congruences (see Definition 8) is that in a 
variety, only finite index congruences are required. Moreover, the composi- 
tion of the corresponding homomorphisms in item ii) is not required to be 
surjective. 

The congruence approach is very helpful because it is fundamentally 
constructive and one can systematically generate *-varieties of congruences of 
increasing complexity. The idea behind Thérien’s method to provide arbitrary 
assignments of congruences is captured in the following proposition. 


Proposition 10 Let B be an alphabet and let 0 be a relation on the free 
monoid B*. Let A be an alphabet and let § = (yi: A* > B*)ier be a family 
of monoid homomorphisms indexed over a set I. Consider the relation 03 
on A* defined as follows. For words v,w € A*, 


vOzw if and only if = for alli cI, yi(v)Oy;(w). 


If 0 is a congruence on B*, then 03 1s a congruence on A*. 
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Thérien uses the free monoid on one generator (N,+,0) and, for an 
alphabet A, the family of monoid homomorphisms F = (|: |a: A* > N)aea, 
where |w|q is defined as the number of occurrences of the letter a in w. 
Recall that finite index congruences on N are completely determined by 
the congruences 6, (see item 4 in Examples 5). In this case, the basic 
congruences Qn, used by Thérien are described as follows. For words 
v,w € A’, 


Vn pW ifand only if forallae A, |vla An, |wla- 
Then the image of the function A,,, on an alphabet A is defined as 
An,r(A) = {8 € Con(A) | @ has finite index and an,, C 6}. 


Clearly A, is a *-variety of congruences. Some monoid characterizations 
for these varieties are easily derived. Let Mcom be the pseudovariety of 
commutative monoids and let My,» be the pseudovariety of all monoids MW 
in which m”*" = m” for all m € M. Then we have the following theorem. 


Theorem 13 ([28, Theorem 1.2]) For anyn > 0 andr > 1 the *-variety 
of congruences A, and the pseudovariety Mcom M1 Mn are in correspon- 
dence, that is, a finite monoid A*/@ belongs to Mcom™ Mn if and only if 
6 € A, ,(A). 


This definition could be further extended to count subwords as well. 
All in all, Thérien’s finite index congruences are defined by four parameters; 
the length m of the subwords that are counted, the depth 7 of the recursion, 
and the type indices n and r of the congruence 6, on N. Moreover, these 
families of congruences AG can be ordered by inclusion according to the 
values m,i,n and r, thus providing hierarchies of increasing complexity. 

Some interesting pseudovarieties were thus characterized in such a way. 
Among others, Thérien characterizes the pseudovarieties of commutative 
aperiodic monoids, commutative groups, commutative monoids, J-trivial 
monoids, nilpotent groups, and solvable groups of derived [fitting] length 
< k. Not all monoids, however, can be characterized in such a way. Left 
out of this classification are all non-cyclic simple groups, and consequently, 
the monoids containing such groups. Certainly, one of the most important 
problems to solve in the congruence approach is to constructively describe 
all regular languages. 
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5.2.1 The Profinite Approach 


An inherent difficulty in dealing with pseudovarieties of monoids is that in 
general they do not have free objects as these are usually infinite structures. 
To avoid this restriction, we need some kind of limiting process. The 
appropriate construction is the projective (or inverse) limit of the desired 
family of monoids in the wider setting of topological semigroups (see [2, 24, 
23)). 

By a topological semigroup we mean a semigroup S endowed with a 
topology such that the binary operation S x S — S' is continuous. This is 
not a problem in the finitary case since a finite semigroup can be always 
viewed as a topological semigroup under the discrete topology. A profinite 
semigroup is then defined to be the projective limit of a projective system of 
finite semigroups. In general, if V is a pseudovariety of semigroups a pro-V 
semigroup is defined to be the projective limit of a projective system of 
semigroups in V. The most interesting example, in our case, is constructed 
as follows. For a generating alphabet A, we take the projective limit of all 
A-generated members of V up to isomorphism. It determines a projective 
system by taking the unique connecting homomorphisms with respect to 
the choice of generators. Following [2], the projective limit of this system is 
denoted 24V. This profinite semigroup is relatively free in the sense that 
the natural mapping 2: A > 0.4V is such that, for every mapping y: A > S$ 
into a pro-V semigroup there exists a unique continuous homomorphism 
@: DAV = S satisfying Go1= y. 

This object can also be constructed by defining a natural metric on 
A* (see [19, 9]). This metric measures the strength of the monoids in V to 
discriminate words in A*. For a finite alphabet A, the completion of A* with 
respect to this metric coincides with 14V. The completion 1.4 V inherits 
a semigroup structure where the product of two elements s,t € Q4V is 
defined by taking any sequences (8,,)ncn and (tn)ncn converging respectively 
to s and ¢ and noting that (S,tn)nen is a Cauchy sequence whose limit st 
does not depend on the choice of the two sequences. This gives Q4V the 
structure of a topological semigroup. 

A pseudoidentity is then a formal equality u = v, with u,v elements in 
a profinite monoid Q4V. Thus, a monoid S satisfies a pseudoidentity if for 
every continuous homomorphism vy: 24V — S, the equality y(u) = y(v) 
holds. One of the most important theorems in this field was proved by 
Reiterman [22], which states that pseudovarieties of semigroups can be 
defined by pseudoidentities. 
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To give an example illustrating Reiterman’s theorem, we now describe 
some important unary implicit operations on finite semigroups. For a finite 
semigroup S, s € S, and k € Z, the sequence (s”'**),,cj becomes constant for 
n sufficiently large. Hence, in a profinite semigroup, the sequence (o*), any 
converges and we denote its limit by s“+*. From the above unary implicit 
operation and multiplication one may already easily construct several implicit 
operations such as x’y” or (xy)”. As an example, the pseudovariety A of all 
finite aperiodic semigroups is defined by the pseudoidentity «+! = x” since 
all subgroups of a semigroup are trivial if and only if all its cyclic subgroups 
are trivial. Some other pseudoidentities defining interesting pseudovarieties 
are known in the literature [18, 2, 19, 23]. 

These results should be compared with the results appearing in [28]. 
Returning to our previous discussion, Thérien introduces new families of 
*_varieties of congruences as follows. For an alphabet A, he defines the 
following set of congruences 


A,.7(A) = {8 € Con(A*) | 6 has finite index and dn € N with 6 € A,,,(A)}, 


and proves that A, is a *-variety of congruences. An immediate result is 
presented. 


Theorem 14 ([28, Corollary 1.1]) The *-variety of congruences A,.1 and 
the pseudovariety AM Meom are in correspondence, that is, a finite monoid 
A*/6@ belongs to AN Meom if and only if 0 € A, 4(A). 


It says that a finite aperiodic commutative monoid M, considered in 
the form of a congruence quotient A*/@ for an alphabet A, is aperiodic and 
commutative if there exists n € N with a, C 6. In particular, for every 
word u € A*, the words u” and u"*! are always a,1-related and, hence, any 
finite aperiodic commutative monoid satisfies the equation 2” = 2”*! for 
sufficiently large n € N. This approach remind us of the original treatment of 
pseudovarieties of monoids done by Eilenberg [6, 20], where he recognised that 
pseudovarieties could be characterized by infinite sequence of identities, with 
each semigroup satisfying all but finitely many identities in each sequence. 
This idea requires a further exposition. Let V be a pseudovariety of monoids 
and let A be an alphabet. For n € N, consider the following set of congruences 
over A*, 

Cr, = {8 € Con(A*) | |A*/0| < n and A*/@€ V}. 


Since there exists up to isomorphism finitely many A-generated monoids 
in V with cardinal less or equal than n, the set of congruences C’, is finite. 
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It follows that for each n € N, the relation 
On =( \{@ € Con(A*) | 0 € Cn} 


is a congruence on A* whose quotient is a monoid in V (note that A*/O, 
is a finite subdirect product of all monoids A*/6 with 0 € C;,) and for each 
6 € C,, there exists a surjective monoid homomorphism from A*/O,, to A*/6. 
Moreover, such congruences are ordered in a sequence 


Oo D 01 D On 2D::- 


and, as Eilenberg suggested, every A-generated monoid in V satisfies all but 
finitely many identities in the sequence. Note that for every word w € A* if 
we take a class representative wy, in [wle,, with n € N, the sequence (wp) nen 
is a Cauchy sequence whose limit belongs to 04V. In this case, the set 


V(A) = {6 € Con(A*) | 6 has finite index and A*/@ € V} 


with reversed inclusion is a projective onto system that induces the same 
projective systems of finite monoids required in the construction of the 
relatively free pro-V monoid Q4V. 

What we just proved is that these kind of projective systems contain a 
simple cofinal sequence. Following [13] a directed set (1, <) is called special 
if every inverse onto system of type (J,<) has nonempty inverse limit. The 
main theorem in [13] states the following. 


Theorem 15 ([13]) The necessary and sufficient condition for a directed 
set (I,<) to be special is that it possess either a maximal element or a simple 
cofinal sequence. 


This theorem can be exploited in two directions; the first one to recover 
a purely algebraic description of the relatively free pro-V monoid without 
direct appealing to topological arguments in its construction. Note that 
most of these topological arguments are used to guarantee the existence of 
elements in the projective limit in the sense of Henkin’s theorem (see [23]). 

The second direction is more ambitious; Theorem 6 opens up the 
possibility of an “equational” description of formations of monoids in the 
same sense of Reiterman’s theorem for pseudovarieties. This result would 
mimic the process we just presented for the case of pseudovarieties and 
*_varieties of congruences. 


204 ~~ Ballester-Bolinches, Cosme-Llépez, Esteban-Romero, Rutten 


For a formation of monoids F and an alphabet A, the set 
F(A) = {0 € Con(A*) | A*/0 € F} 


ordered with reversed inclusion is also a directed set. By Theorem 4 F is 
a formation of congruences and, consequently, F(A) is a filter in Con(A*). 
Henkin’s Theorem 15 would require this directed set to have a cofinal 
sequence or a maximal element to guarantee the construction of a relatively 
free pro-F monoid. Consequently, the equations would be pairs of elements 
in this relatively free monoid with a similar notion of satisfaction of equation 
as in pseudovarieties. This approach faces a big problem; it relies on Henkin’s 
theorem and we would need to investigate necessary and sufficient conditions 
on a filter to have cofinal sequence or maximal element (with respect to the 
reversed order). There are, possibly, other options to face this problem but, 
certainly, a simple adaptation of the topological approach seems useless as 
it clearly relies on the finiteness of the monoids. 

It would be also interesting to extend the results on duality theory 
presented by Gehrke, Pin and Grigorieff on regular languages. One of the 
most interesting consequences of their work is a theorem stating that lattices 
of regular languages, that is sets of regular languages closed under finite 
intersection and finite union can be defined by sets of profinite inequations 
(9, Theorem 5.2]. It is intimately based on the connection between duality 
theory and the algebraic theory of finite state automaton presented in detail 
in [7, 8]. This result is an instantiation of the duality between sublattices in 
the set Reg(A*) of all regular languages over an alphabet A and preorders 
on its dual space A, the relatively free profinite monoid. A corollary of this 
result is precisely Reiterman’s original result [9, Corollary 5.3]. All in all, 
this line of research work deserves further attention. 


5.3. On Varieties 


This work is clearly related with a variant of Eilenberg’s variety theorem [6] 
that was presented in [3]. There, varieties of finite monoids were replaced 
by varieties of monoids (as stated by Birkhoff [5]) and varieties of regular 
languages were replaced by varieties of languages. The definition of variety 
of languages is given in terms of equations and coequations as we did in 
Definition 9. We recall these definitions. 


Definition 12 A variety of monoids is a class of monoids V satisfying the 
following conditions. 
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(i) every homomorphic image of a monoid of V belongs to V; 
(iit) every submonoid of a monoid of V belongs to V; 


(itt) the direct product of every family of monoids of V also belongs to V. 


Birkhoff proved two main results; the characterisation of varieties by 
sets of identities and the closure conditions a set of algebras must satisfy in 
order to be a variety. As a consequence, varieties of monoids are equationally 
defined classes of monoids [16, 5]. The following theorem of Kogalovskii [14] 
(see also [16, 10]) characterises varieties of monoids in terms of quotients 
and subdirect products. 


Theorem 16 A class of monoids V is a variety if and only if it is closed 
under taking arbitrary subdirect products and quotients. 


Consequently, the main difference between varieties of monoids and 
formations of monoids is that in a variety, arbitrary subdirect products are 
allowed. In fact, Kogalovskii proved that from the closure under quotients 
and arbitrary subdirect products we retrieve closure under submonoids. To 
mimic this property and bearing in mind an Eilenberg result for varieties, 
the following definition was presented in [3]. 


Definition 13 A variety of languages is a function V that assigns to every 
alphabet A a set of formal languages satisfying the following conditions. 


(i) for each alphabet A, if L is a language in V(A), then coEq(A*/Eq(L)) 
is included in V(A); 


(ii) for each alphabet A, if the set {coEq(A*/C;) | 7 € I} is included in 
V(A), then so is coEq(A*/(),<7 Ci); 


(iti) for every two alphabets A and B, if L is a language in F(B) and 
n: B* — free((L)) denotes the quotient morphism, then for each 
monoid morphism pp: A* — B* such that n° ~ is surjective, the set 
coEq(A*/ker(7 0 y)) belongs to F(A). 


Here, we require closure under arbitrary intersection of congruences to 
mirror the respective closure under arbitrary products in the definition of 
variety of monoid (see Definition 9). It was shown in [3] that varieties of 
languages are in one-to-one correspondence with varieties of monoids. These 
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differences have also a counterpart in the congruence side. Among other 
particularities for varieties, the residual of a monoid is always a congruence 
whose quotient is a monoid in the variety. 


Proposition 11 If V is a variety of monoids, then for every monoid M, 
the quotient M/C¥ is a monoid in V. 


Proof: Note that M/ Ce is the subdirect product of the family of all 
quotients of M in V. Kogalovskii’s Theorem 16 guarantees us that this 
subdirect product is in V. 


As a consequence, if V is a variety of monoids, the assignment of 
Proposition 4 maps each alphabet A to a principal filter. In this case, the 
principal filter generated by the residual of A* over V 


V: At-> {C € Con(A*) | A*/C € V} = [CH ). 


In the case of varieties, Theorem 6 gives a correspondence between 
varieties V and formations of congruences V satisfying that for all A, the set 
V(A) is a principal filter in Con(A*). That is, a monoid A*/@ belongs to a 
variety V if and only if Cr C @ or, equivalently, it satisfies some equations 
(being precisely the pairs of words in Ce.) which is precisely one of the 
main results in Birkhoff’s theorem. In this case, the residual is a maximum 
element with respect to reversed inclusion of all congruences in V(A) (see 
the discussion of Theorem 15). Again, this result clearly underscores the 
connection between congruences and equations on the description of varieties 
of monoids. 
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